图形信号处理(GSP)中的基本前提是,将目标信号的成对(反)相关性作为边缘权重以用于图形过滤。但是,现有的快速图抽样方案仅针对描述正相关的正图设计和测试。在本文中,我们表明,对于具有强固有抗相关的数据集,合适的图既包含正边缘和负边缘。作为响应,我们提出了一种以平衡签名图的概念为中心的线性时间签名的图形采样方法。具体而言,给定的经验协方差数据矩阵$ \ bar {\ bf {c}} $,我们首先学习一个稀疏的逆矩阵(Graph laplacian)$ \ MATHCAL {l} $对应于签名图$ \ Mathcal $ \ Mathcal {G} $ 。我们为平衡签名的图形$ \ Mathcal {g} _b $ - 近似$ \ Mathcal {g} $通过Edge Exge Exgement Exgmentation -As Graph频率组件定义Laplacian $ \ Mathcal {L} _b $的特征向量。接下来,我们选择样品以将低通滤波器重建误差分为两个步骤最小化。我们首先将Laplacian $ \ Mathcal {L} _b $的所有Gershgorin圆盘左端对齐,最小的EigenValue $ \ lambda _ {\ min}(\ Mathcal {l} _b)$通过相似性转换$ \ MATHCAL $ \ MATHCAL} s \ Mathcal {l} _b \ s^{ - 1} $,利用最新的线性代数定理,称为gershgorin disc perfect perfect对齐(GDPA)。然后,我们使用以前的快速gershgorin盘式对齐采样(GDAS)方案对$ \ Mathcal {L} _p $进行采样。实验结果表明,我们签名的图形采样方法在各种数据集上明显优于现有的快速采样方案。
translated by 谷歌翻译
在县粒度上预测每年农作物的产量对于国家粮食生产和价格稳定至关重要。在本文中,为了实现更好的作物产量预测,利用最新的图形信号处理(GSP)工具来利用相邻县之间的空间相关性,我们通过图形光谱滤波来证明相关的特征,这些特征是深度学习预测模型的输入。具体而言,我们首先构建一个具有边缘权重的组合图,该图可以通过公制学习编码土壤和位置特征的县对县的相似性。然后,我们通过最大的后验(MAP)配方使用图形laplacian正常化程序(GLR)来定性特征。我们关注的挑战是估算关键的权重参数$ \ mu $,交易忠诚度和GLR,这是噪声差异的函数,以无监督的方式。我们首先使用发现局部恒定区域的图集集合检测(GCD)过程直接从噪声浪费的图形信号估算噪声方差。然后,我们通过通过偏置变化分析来计算最佳$ \ mu $最大程度地减少近似平方误差函数。收集到的USDA数据的实验结果表明,使用DeNo的特征作为输入,可以明显改善作物产量预测模型的性能。
translated by 谷歌翻译
Text-based personality computing (TPC) has gained many research interests in NLP. In this paper, we describe 15 challenges that we consider deserving the attention of the research community. These challenges are organized by the following topics: personality taxonomies, measurement quality, datasets, performance evaluation, modelling choices, as well as ethics and fairness. When addressing each challenge, not only do we combine perspectives from both NLP and social sciences, but also offer concrete suggestions towards more valid and reliable TPC research.
translated by 谷歌翻译
Stance detection (SD) can be considered a special case of textual entailment recognition (TER), a generic natural language task. Modelling SD as TER may offer benefits like more training data and a more general learning scheme. In this paper, we present an initial empirical analysis of this approach. We apply it to a difficult but relevant test case where no existing labelled SD dataset is available, because this is where modelling SD as TER may be especially helpful. We also leverage measurement knowledge from social sciences to improve model performance. We discuss our findings and suggest future research directions.
translated by 谷歌翻译
Synergetic use of sensors for soil moisture retrieval is attracting considerable interest due to the different advantages of different sensors. Active, passive, and optic data integration could be a comprehensive solution for exploiting the advantages of different sensors aimed at preparing soil moisture maps. Typically, pixel-based methods are used for multi-sensor fusion. Since, different applications need different scales of soil moisture maps, pixel-based approaches are limited for this purpose. Object-based image analysis employing an image object instead of a pixel could help us to meet this need. This paper proposes a segment-based image fusion framework to evaluate the possibility of preparing a multi-scale soil moisture map through integrated Sentinel-1, Sentinel-2, and Soil Moisture Active Passive (SMAP) data. The results confirmed that the proposed methodology was able to improve soil moisture estimation in different scales up to 20% better compared to pixel-based fusion approach.
translated by 谷歌翻译
Machine Learning (ML) technologies have been increasingly adopted in Medical Cyber-Physical Systems (MCPS) to enable smart healthcare. Assuring the safety and effectiveness of learning-enabled MCPS is challenging, as such systems must account for diverse patient profiles and physiological dynamics and handle operational uncertainties. In this paper, we develop a safety assurance case for ML controllers in learning-enabled MCPS, with an emphasis on establishing confidence in the ML-based predictions. We present the safety assurance case in detail for Artificial Pancreas Systems (APS) as a representative application of learning-enabled MCPS, and provide a detailed analysis by implementing a deep neural network for the prediction in APS. We check the sufficiency of the ML data and analyze the correctness of the ML-based prediction using formal verification. Finally, we outline open research problems based on our experience in this paper.
translated by 谷歌翻译
Transformers have become central to recent advances in computer vision. However, training a vision Transformer (ViT) model from scratch can be resource intensive and time consuming. In this paper, we aim to explore approaches to reduce the training costs of ViT models. We introduce some algorithmic improvements to enable training a ViT model from scratch with limited hardware (1 GPU) and time (24 hours) resources. First, we propose an efficient approach to add locality to the ViT architecture. Second, we develop a new image size curriculum learning strategy, which allows to reduce the number of patches extracted from each image at the beginning of the training. Finally, we propose a new variant of the popular ImageNet1k benchmark by adding hardware and time constraints. We evaluate our contributions on this benchmark, and show they can significantly improve performances given the proposed training budget. We will share the code in https://github.com/BorealisAI/efficient-vit-training.
translated by 谷歌翻译
在医学科学中,在不同疾病上收集多个数据非常重要,并且数据最重要的目标是调查疾病。心肌梗死是死亡率的严重危险因素,并且在以往的研究中,主要重点是通过人口统计学特征,超声心动图和心电图测量心肌梗死的可能性。相反,本研究的目的是利用数据分析算法,并比较他们的心脏病发作患者的准确性,以便通过考虑到应急行动并因此预测心肌梗死期间心肌梗死期间的心肌强度。为此目的,通过数据分析的分类技术收集和研究,包括随机的分类技术,包括随机的分类技术来收集和研究,包括年龄,紧急操作时间,肌酸磷酸氨基酶(CPK)试验,心率,血糖和静脉的105名心肌梗死患者。决策林,决策树,支持向量机(SVM),k离邻居和序数逻辑回归。最后,在平均评估指标方面,选择了精度为76%的随机决定林的模型作为最佳模型。此外,肌酸磷酸氨基酶试验,尿素,白色和红细胞计数,血糖,时间和血红蛋白的七种特征被鉴定为喷射分数变量的最有效特征。
translated by 谷歌翻译
本文介绍了Okapi,用于自然语言的新数据集到可执行的Web应用程序编程接口(NL2API)。此数据集是英文,包含22,508个问题和9,019个独特的API呼叫,涵盖三个域。我们为NL2API定义了新的组成泛化任务,该任务探讨了在推理阶段中的培训中从简单API调用外推开的模型能力。此外,该模型必须生成正确执行的API调用,而不是与现有方法进行正确执行,该方法评估具有占位符值的查询。我们的数据集与大多数现有的组合语义解析数据集不同,因为它是一个非合成数据集,研究了低资源设置中的组成概括。 Okapi是创建现实数据集和基准的一步,用于研究与现有数据集和任务一起学习组成泛化。我们报告了在各种扫描和okapi数据集任务上培训的序列到序列基线模型的泛化能力。当从简单API调用概括到更复杂的API调用时,最佳模型可实现15 \%精确匹配的准确性。这突出了未来研究的一些挑战。 okapi数据集和任务在https://aka.ms/nl2api/data上公开使用。
translated by 谷歌翻译
本文涵盖了基于N组的加强学习(RL)算法。我们为TD-,Sarsa-and Q-Learning提供了新的算法,这些算法在各种游戏中无缝工作,任意数量的玩家。这是通过采用以球员为中心的视图来实现的,其中每个玩家将他/她的奖励传播到以前的轮次。我们将称为最终适应RL(Farl)的新元素添加到所有这些算法。我们的主要贡献是,Farl是一项最重要的成分,可以在各种游戏中以可爱的球员为中心的观点实现成功。我们向七个棋盘游戏报告结果1,2和3名球员,包括奥赛罗,Connectfour和Hex。在大多数情况下,发现Farl非常重要,无法学习近乎完美的竞争策略。所有算法都在GitHub上的GBG框架中提供。
translated by 谷歌翻译